AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Optical Character Recognition

# Optical Character Recognition

NVLM D 72B
NVLM 1.0 is a series of cutting-edge multimodal large language models that achieve state-of-the-art results in vision-language tasks, comparable to leading proprietary and open-access models.
Image-to-Text Transformers English
N
nvidia
14.33k
769
Trocr Large Str
TrOCR is a Transformer-based optical character recognition model designed for single-line text images, fine-tuned on multiple standard datasets.
Text Recognition Transformers
T
microsoft
571
17
Trocr Small Stage1
TrOCR is a Transformer-based pre-trained optical character recognition model that adopts an encoder-decoder architecture, suitable for OCR tasks on single-line text images.
Image-to-Text Transformers
T
microsoft
3,713
12
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase